AITopics | false question

Collaborating Authors

false question

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Species196: A One-Million Semi-supervised Dataset for Fine-grained Species Recognition Wei He, Kai Han

Neural Information Processing SystemsFeb-15-2026, 19:20:29 GMT

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > China (0.05)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report (0.46)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)

Add feedback

8cb92f326d01fd7f4371283ee2fa6386-Paper-Datasets_and_Benchmarks.pdf

Neural Information Processing SystemsOct-9-2025, 01:03:38 GMT

large language model, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > China (0.05)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report (0.46)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)

Add feedback

b51a15f382ac914391a58850ab343b00-Supplemental.pdf

Neural Information Processing SystemsAug-17-2025, 00:37:17 GMT

artificial intelligence, false question, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.71)

Add feedback

Second Opinion Matters: Towards Adaptive Clinical AI via the Consensus of Expert Model Ensemble

Kumthekar, Amit, Tilley, Zion, Duong, Henry, Patel, Bhargav, Magnoli, Michael, Omar, Ahmed, Nasser, Ahmed, Gharpure, Chaitanya, Reztzov, Yevgen

arXiv.org Artificial IntelligenceJun-24-2025

Despite the growing clinical adoption of large language models (LLMs), current approaches heavily rely on single model architectures. To overcome risks of obsolescence and rigid dependence on single model systems, we present a novel framework, termed the Consensus Mechanism. Mimicking clinical triage and multidisciplinary clinical decision-making, the Consensus Mechanism implements an ensemble of specialized medical expert agents enabling improved clinical decision making while maintaining robust adaptability. This architecture enables the Consensus Mechanism to be optimized for cost, latency, or performance, purely based on its interior model configuration. To rigorously evaluate the Consensus Mechanism, we employed three medical evaluation benchmarks: MedMCQA, MedQA, and MedXpertQA Text, and the differential diagnosis dataset, DDX+. On MedXpertQA, the Consensus Mechanism achieved an accuracy of 61.0% compared to 53.5% and 45.9% for OpenAI's O3 and Google's Gemini 2.5 Pro. Improvement was consistent across benchmarks with an increase in accuracy on MedQA ($Δ\mathrm{Accuracy}_{\mathrm{consensus\text{-}O3}} = 3.4\%$) and MedMCQA ($Δ\mathrm{Accuracy}_{\mathrm{consensus\text{-}O3}} = 9.1\%$). These accuracy gains extended to differential diagnosis generation, where our system demonstrated improved recall and precision (F1$_\mathrm{consensus}$ = 0.326 vs. F1$_{\mathrm{O3\text{-}high}}$ = 0.2886) and a higher top-1 accuracy for DDX (Top1$_\mathrm{consensus}$ = 52.0% vs. Top1$_{\mathrm{O3\text{-}high}}$ = 45.2%).

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2505.23075

Country: Asia > East Asia (0.04)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area > Nephrology (1.00)
Health & Medicine > Therapeutic Area > Hematology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.34)

Add feedback

Mitigating Sycophancy in Decoder-Only Transformer Architectures: Synthetic Data Intervention

Wang, Libo

arXiv.org Artificial IntelligenceNov-20-2024

To address the sycophancy problem caused by reinforcement learning from human feedback in large language models, this research applies synthetic data intervention technology to the decoder-only transformer architecture. Based on the research gaps in the existing literature, the researcher designed an experimental process to reduce the tendency of models to cater by generating diversified data, and used GPT4o as an experimental tool for verification. The experiment used 100 true and false questions, and compared the performance of the model trained with synthetic data intervention and the original untrained model on multiple indicators. The results show that the SDI training model supports the technology in terms of accuracy rate and sycophancy rate and has significant effectiveness in reducing sycophancy phenomena.

arxiv preprint arxiv, language model, sycophancy, (9 more...)

arXiv.org Artificial Intelligence

2411.10156

Country:

Asia > Malaysia > Kuala Lumpur > Kuala Lumpur (0.04)
Europe > Poland > Kuyavian-Pomeranian Province > Toruń (0.04)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Exploring the Comprehension of ChatGPT in Traditional Chinese Medicine Knowledge

Yizhen, Li, Shaohan, Huang, Jiaxing, Qi, Lei, Quan, Dongran, Han, Zhongzhi, Luan

arXiv.org Artificial IntelligenceMar-14-2024

No previous work has studied the performance of Large Language Models (LLMs) in the context of Traditional Chinese Medicine (TCM), an essential and distinct branch of medical knowledge with a rich history. To bridge this gap, we present a TCM question dataset named TCM-QA, which comprises three question types: single choice, multiple choice, and true or false, to examine the LLM's capacity for knowledge recall and comprehensive reasoning within the TCM domain. In our study, we evaluate two settings of the LLM, zero-shot and few-shot settings, while concurrently discussing the differences between English and Chinese prompts. Our results indicate that ChatGPT performs best in true or false questions, achieving the highest precision of 0.688 while scoring the lowest precision is 0.241 in multiple-choice questions. Furthermore, we observed that Chinese prompts outperformed English prompts in our evaluations. Additionally, we assess the quality of explanations generated by ChatGPT and their potential contribution to TCM knowledge comprehension. This paper offers valuable insights into the applicability of LLMs in specialized domains and paves the way for future research in leveraging these powerful models to advance TCM.

chatgpt, explanation, knowledge, (15 more...)

arXiv.org Artificial Intelligence

2403.09164

Country:

Asia > China > Beijing > Beijing (0.05)
North America > United States (0.04)
Oceania > Australia (0.04)
North America > Canada (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Education (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Species196: A One-Million Semi-supervised Dataset for Fine-grained Species Recognition

He, Wei, Han, Kai, Nie, Ying, Wang, Chengcheng, Wang, Yunhe

arXiv.org Artificial IntelligenceOct-28-2023

The development of foundation vision models has pushed the general visual recognition to a high level, but cannot well address the fine-grained recognition in specialized domain such as invasive species classification. Identifying and managing invasive species has strong social and ecological value. Currently, most invasive species datasets are limited in scale and cover a narrow range of species, which restricts the development of deep-learning based invasion biometrics systems. To fill the gap of this area, we introduced Species196, a large-scale semi-supervised dataset of 196-category invasive species. It collects over 19K images with expert-level accurate annotations (Species196-L), and 1.2M unlabeled images of invasive species (Species196-U). The dataset provides four experimental settings for benchmarking the existing models and algorithms, namely, supervised learning, semi-supervised learning, self-supervised pretraining and zero-shot inference ability of large multimodal models. To facilitate future research on these four learning paradigms, we conduct an empirical study of the representative methods on the introduced dataset. The dataset is publicly available at https://species-dataset.github.io/.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2309.14183

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Asia > China (0.05)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Fourth International Workshop on Artificial Intelligence and Statistics

AI MagazineJan-4-2018, 13:45:29 GMT

The workshops on Artificial Intelligence and Statistics have broadened the flow of information between the two fields and encouraged interdisciplinary work. The workshop is in English and includes one day of tutorials and three days of focussed poster sessions, presentations and panels. The presentations are scheduled in the mornings and evenings, leaving the afternoons free for discussions in more relaxed environments. The workshop will be held at the Pier 66 Resort and Marina - a 22 acre fullfeatured resort located on the intracoastal waterway. Secondly, the class soon realizes that sexual differences are much reduced from Turing's time-women know about sports and men often can answer cooking questions.

question, space agency, us government, (16 more...)

AI Magazine

Industry:

Government > Regional Government > North America Government > US Government (0.54)
Government > Space Agency (0.35)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback